Picture for Tingting Gao

Tingting Gao

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Add code
May 05, 2025
Viaarxiv icon

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding

Add code
Apr 30, 2025
Viaarxiv icon

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

Add code
Apr 21, 2025
Viaarxiv icon

InstructEngine: Instruction-driven Text-to-Image Alignment

Add code
Apr 14, 2025
Viaarxiv icon

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models

Add code
Apr 09, 2025
Viaarxiv icon

TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs

Add code
Mar 13, 2025
Viaarxiv icon

Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

iMOVE: Instance-Motion-Aware Video Understanding

Add code
Feb 18, 2025
Viaarxiv icon

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization

Add code
Feb 03, 2025
Figure 1 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Figure 2 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Figure 3 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Figure 4 for Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization
Viaarxiv icon

Kwai-STaR: Transform LLMs into State-Transition Reasoners

Add code
Nov 07, 2024
Viaarxiv icon